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METHOD FOR IMPROVING THE QUALITY OF 
MS ACOUSTIC SIGNAL 

BACKGROUND 

The present disclosure relates to systems and methods for 
measuring and monitoring the quality of a speech signal and 
providing the system and user with corrective action 
suggestions . 

In the field of human-machine speech interface, automatic 
speech recognition, and voiced telecommunication, the quality 
of speech data is often degraded by a number of factors. The 
degradation factors include improper placement of microphone, 
improper amplifier gain, microphone being turned off 
unknowingly, speaker voice quality and level, or noise 
interference. This results in system performance degradation 
and unsatisfactory user experience. 

The prior art systems attempt to control the on/off state 
of the microphone using a hardware switch, often under control 
of the user. However, information about the on/off state of 
the microphone often may not get passed on to the rest of the 
system. This oversight may result in system failure and user 
frustration. Further, the prior art systems fail to take into 
consideration the difference between noise and signal, and 
therefore attempt to control the microphone gain based on the 
amplitude of the noise and signal. 
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SUMMARY 

The present disclosure includes methods, systems, and 
computer programs to continuously and automatically monitor 
the quality of an acoustic signal and provide feedback to the 
system or user for corrective actions. The input signal may 
represent human speech, but it should be recognized that the 
system may be used to monitor any type of acoustic data, such 
as musical instruments. 

The preferred embodiment of the invention monitors input 
data as follows. An input signal is digitized into binary 
data. The digitized time series is analyzed to determine if 
the microphone is on or off. If the microphone is deemed to 
be in a different state than that expected by the system, a 
message is provided to the user or system suggesting a 
corrective action, such as turning the microphone on. The 
system can also take internal actions, such as refrain from 
adjusting to the data, since it does not correspond to 
acoustic data. 

The acoustic data is then transformed to the frequency 
domain. The data is analyzed in the frequency domain to 
measure its quality, such as signal-to-noise ratio. If the 
quality of the data is poor, a message is passed on to the 
user or system suggesting a corrective action. 
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The quality of the data is continuously analyzed so that 
even if the quality is good at the beginning but degrades 
later on, the degradation is still detected and acted upon. 
This continuous and automatic monitoring of data quality and 
the ensuing user feedback provides the user with an overall 
more satisfying experience than would otherwise occur. 

The details of one or more embodiments of the invention 
are set forth in the accompanying drawings and the description 
below. Other features, objects, and advantages of the 
invention will be apparent from the description and drawings, 
and from the claims. 



DESCRIPTION OF DRAWINGS 

FIG. 1 is a block diagram of a programmable processing 
system in accordance with an embodiment of the present 
invention. 

FIG. 2 is a block diagram of an acoustic signal 
monitoring system according to an embodiment of the present 
invention . 

FIG. 3 is a method for monitoring an acoustic signal in 
accordance with an embodiment of the present invention. 

FIG. 4 is a flowchart of an acoustic time series analysis 
in accordance with an embodiment of the present invention. 
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FIG. 5 is a flowchart of a joint time series and spectrum 
analysis according to an embodiment of the present invention. 

Like reference numbers and designations in the various 
drawings indicate like elements. 



DETAILED DESCRIPTION 

Throughout this description, the embodiments and examples 
shown should be considered as exemplars rather than as 

C3 

^3 limitations of the invention. 

Ul 

=vj 10 The inventors recognized that it would be desirable to 

01 

gi have a monitoring system that enables automatic and continuous 

monitoring of a speech signal quality. The monitored data may 
be used to determine which factors are responsible for non- 
optimum quality. The monitoring system may supply the user or 
15 audio system with appropriate feedback for corrective actions. 
The present disclosure also provides a method for enabling 
such a monitoring system. 

FIG. 1 shows a block diagram of a programmable processing 
system 100 in accordance with an embodiment of the present 
20 invention. The processing system 100 may be used for 

implementing an acoustic signal monitoring system 108. In one 
embodiment, the processing system 100 also includes a 
processor 110, memory 112, a display controller, and a user 
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display 118. The user display 118 may be a system that 
provides corrective actions as a video or audio feedback. 

In the illustrated embodiment, an acoustic signal is 
received at a transducer microphone 102. The transducer 
microphone 102 generates corresponding electrical signal 
representation of the acoustic signal. An amplifier 104 may 
amplify the electrical signal from the transducer microphone 
102. The amplified signal may then be converted to a digital 
signal by an A-to-D converter 106. 

The output of the A-to-D converter 106 is applied to the 
processing system 100. The processing system 100 may include 
a CPU 110, memory 112, and a storage device 114, coupled to a 
CPU bus as shown. The memory 112 may include writable memory 
such as a flash ROM. The storage device 114 may be any 
storage device, such as a magnetic disk, that enables storage 
of data. 

The acoustic signal monitoring system 108 performs below- 
described monitoring and classification techniques to the 
acoustic signal. The status and output of the acoustic signal 
monitoring system 108 may be displayed for the benefit of a 
human user by means of a display controller 116. The display 
controller 116 drives a display 118, such as a video or sound 
display. The output may also be used by the audio system to 
adjust its parameters, such as amplifier gain. 
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A block diagram of the acoustic signal monitoring system 
108 according to an embodiment of the present invention is 
shown in FIG. 2. The monitoring system 108 includes a time- 
series analyzer 200, a frequency transform 202, and a 
parameter adjustment element 204. 

The time-series analyzer 200 performs detection of the 
microphone's on/off state. The analyzer 2 00 may also monitor 
and control the overall gain of an audio system. In some 
embodiments, the time-series analyzer 200 adjusts amplifier 
gains to substantially reduce clipping or overloading of the 
amplifier. In other embodiments, the time-series analyzer 200 
monitors and reports these undesirable conditions to the user 
and/or the audio system. 

The frequency transform 2 02 performs transformation of 
incoming acoustic signal into frequency domain for signal 
analysis in the frequency domain. The transformed signal is 
then directed to the parameter adjustment element 204. The 
parameter adjustment element 204 is a joint analysis of the 
time series and the spectrum. The element 204 performs 
detection of the microphone position with respect to the audio 
source. For example, the microphone may be positioned too 
close to the mouth airflow direction causing "puffing" sound. 

In another example, the microphone may be too far away from 
the audio source having poor signal-to-noise ratio. A report 
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may be generated as an output to report these undesirable 
conditions to the user suggesting a list of corrective actions 
appropriate to the situation. 

FIG. 3 is a method for monitoring an acoustic signal in 
5 accordance with an embodiment of the present invention. The 
incoming acoustic signal includes a plurality of data samples 
generated as output from the A-to-D converter. 

The incoming data stream is read into a computer memory 
as a set of samples at 300. In some embodiments, the method 

CJ 

^3 10 is applied to enhance a "moving window" of data representing 

U1 

'^'4 portions of a continuous acoustic data stream until the entire 

SI 

yl data stream is processed. Generally, an acoustic data stream 

01 to be enhanced is represented as a series of data "buffers" of 

P fixed length, regardless of the duration of the original 

U1 

15 acoustic data stream. 

Ft S 
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p At 302, an analysis of the acoustic time series is 

'""''^ performed on the sampled data stream. The analysis enables 

detection of the microphone's on/off state. The analysis also 

enables adjustment of overall gains to prevent clipping or 
20 overloading of the amplifier. If any one of these conditions 

occurs, a message is provided to the user and the audio system 

at 304. 

A frequency domain transformation is performed at 306 to 
enable frequency domain analysis. Gain adjustment is 
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performed at 308 based on frequency domain analysis of the 
acoustic signal-to-noise ratio. The frequency domain analysis 
allows detection of improper placement of the microphone with 
respect to the audio source. If undesirable placement of the 
microphone is detected, a message is sent to the user at 310 
suggesting a list of corrective actions appropriate to the 
situation. If end of data is detected at 312, the process 
terminates. Otherwise, the above steps are repeated for next 
stream of data. 

A flowchart of an acoustic time series analysis is shown 
in FIG. 4 in accordance with an embodiment of the present 
invention. At 400, an acoustic signal is analyzed in time 
domain to perform detection of signal clipping. If the signal 
is clipped, the gain of the amplifier is adjusted at 402. At 
404, a DC offset is calculated. The calculated DC offset may 
then be adjusted at 406. At 408, a root -mean- squared (RMS) 
value of the acoustic signal may be calculated to determine 
the on/off state of the microphone. 

The determination of the on/off state involves comparing 
the RMS value of the data with a threshold at 410. The 
threshold value may be adjusted for each system in a separate 
calibration phase. If the RMS value is below the threshold, a 
message is sent to both the user display and the client system 
at 412. The message informs the user and the client system 
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that the microphone is turned off at the present. The client 
system includes an automatic speech recognition system, or a 
communication system , 

A flowchart of joint time series and spectrum analysis is 
illustrated in FIG. 5. "Signal" and "noise" levels are 
determined at 500. Here the "signal" is defined as the data 
of interest for the client system, and the "noise" is defined 
as everything else. For example, speech is a signal for a 
client system that performs automatic speech recognition. In 
the illustrated embodiment, the signal detector may be a 
harmonic detector. A signal-to-noise ratio (S/N) is 
calculated from the estimated signal and noise levels. 

The S/N over a period long enough to be representative of 
the overall S/N is estimated at 502. If the amplifier gain is 
found to be too low or too high by the calculation, then a 
feedback signal is sent to the amplifier to adjust the gains 
accordingly at 504 . 

The frequency domain signal may be analyzed to determine 
proper placement of the microphone. For example, if the 
microphone is placed too close to the audio source, "puffing" 
may be detected at 506. This condition is provided to the 
user through a user display. The frequency domain signal may 
be monitored for a low S/N ratio indicating a microphone too 
far from the audio source at 508. The user may be advised to 
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talk louder, or move the microphone closer to the mouth, or 
improve the environment by moving to somewhere less noisy, or 
put on a headset microphone at 510. 

The invention may be implemented in hardware or software, 
or a combination of both (e.g., programmable logic arrays). 

Unless otherwise specified, the algorithms included as part of 
the invention are not inherently related to any particular 
computer or other apparatus. In particular, various general- 
purpose machines may be used with programs written in accordance 
with the teachings herein, or it may be more convenient to 
construct more specialized apparatus to perform the required 
method steps. However, the invention may be implemented in one 
or more computer programs executing on programmable systems each 
comprising at least one processor, at least one data storage 
system (including volatile and non-volatile memory and/or 
storage elements), at least one microphone. The program code is 
executed on the processors to perform the functions described 
herein. 

Each such program may be implemented in any desired 
computer language (including machine, assembly, high level 
procedural, or object oriented programming languages) to 
communicate with a computer system. In any case, the language 
may be a compiled or interpreted language. 
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Each such computer program is preferably stored on a 
storage media or device (e.g., ROM, CD-ROM, or magnetic or 

optical media) readable by a general or special purpose 
programmable computer, for configuring and operating the 
computer when the storage media or device is read by the 
computer to perform the procedures described herein. The 
inventive system may also be considered to be implemented as a 
computer-readable storage medium, configured with a computer 
program, where the storage medium so configured causes a 
computer to operate in a specific and predefined manner to 
perform the functions described herein. 

A number of embodiments of the invention have been 
described. Nevertheless, it will be understood that various 
modifications may be made without departing from the spirit 
and scope of the invention. For example, some of the steps of 
the algorithms may be order independent, and thus may be 
executed in an order other than as described above. Accord- 
ingly, other embodiments are within the scope of the following 
claims . 
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